Filtering Reveals Form in Temporally Structured Displays
نویسنده
چکیده
In their report, Lee and Blake (1) asked whether the visual system could use temporal microstructure to bind image regions into unified objects, as has been proposed in some neural models (2). Lee and Blake presented two regions of dynamic texture. The elements of the target region changed in synchrony according to a random sequence, while the elements of the background region changed at independent times. The stimulus was designed in an attempt to remove all classical form-giving cues such as luminance, contrast, or motion, so that timing itself would provide the only cue. Subjects were readily able to distinguish the shape of the target region. Lee and Blake posited the existence of new visual mechanisms “exquisitely sensitive to the rich temporal structure contained in these highorder stochastic events.” The results have generated much excitement (3). However, we believe that the effects can be explained with well-known mechanisms. The filtering properties of early vision can convert the task into a simple static or dynamic texture discrimination problem. A sustained cell (temporal lowpass) will emphasize static texture through the mechanisms of visual persistence; a transient cell (temporal bandpass) will emphasize texture that is flickering or moving. We simulated a lowpass mechanism to see what would emerge. Lee and Blake’s stimuli were composed of randomly oriented Gabor elements, where the Gabor phase shifted forward or backward on each frame according to a coin-flip. We downloaded one such movie from their Web site and ran it through a temporal lowpass filter (4) (An input frame is shown in Fig. 1A; a filtered output frame is shown in Fig. 1B.). At the particular moment shown, the target region has a lower effective contrast than does the background, providing a strong form cue. At other moments the target’s contrast may be above or below the background’s contrast because of statistical fluctuations in the reversal sequences. If a single Gabor element happens to have a run of multiple shifts in one direction, its effective contrast is low because of the temporal averaging. Conversely, if it has a run of alternating forward and backward shifts, thus “jittering” in place, its contrast remains fairly high. Within the unsynchronized background the local contrasts fluctuate randomly, but within the synchronized target region they all rise and fall in unison, revealing a distinct rectangular form. In a second experiment Lee and Blake synchronized both the target and background region, each to its own random sequence. Here the target was even more clearly visible. This result is predicted by our hypothesis. Since both background and target are synchronized, they will both yield uniform texture contrasts after temporal filtering. There will be moments when, by chance, one region’s contrast is high while the other’s is low, and the target will become especially clear. Figure 1C shows one such moment, again the result of filtering a movie from the Web site with the lowpass filter. Figure 1D shows a moment when the relative contrasts are reversed. We also ran movies through a temporal bandpass filter (5) with a biphasic impulse response, to simulate a transient mechanism. Again, the target was clearly revealed. Our hypothesis also predicts, with the use of either filter, Lee and Blake’s finding that discrimination will be best when the reversal sequences have high entropy, that is, when the coin-flip is unbiased. The contrast cue is best when the target “jitters” in place while the background has a run in a single direction (or vice versa). This condition happens most frequently at high entropy. Lee and Blake’s stimuli are designed to remove form cues from single frames and from frame pairs. However, when one considers the full sequence, strong contrast cues can emerge due to the spatio-temporal filtering present in early vision. These cues probably suffice to explain the perception of form in the experiments. We do not see the need to posit mechanisms other than those already known to exist. Edward H. Adelson Hany Farid Department of Brain and Cognitive Sciences Massachusetts Institute of Technology Cambridge, MA 02139, U.S.A. E-mail: adelson,[email protected]
منابع مشابه
Role of gamma-band synchronization in priming of form discrimination for multiobject displays.
Previous research has shown that synchronized flicker can facilitate detection of a single Kanizsa square. The present study investigated the role of temporally structured priming in discrimination tasks involving perceptual relations between multiple Kanizsa-type figures. Results indicate that visual information presented as temporally structured flicker in the gamma band can modulate the perc...
متن کاملSolute Transport for Pulse Type Input Point Source along Temporally and Spatially Dependent Flow
In the present study, analytical solutions are obtained for two-dimensional advection dispersion equation for conservative solute transport in a semi-infinite heterogeneous porous medium with pulse type input point source of uniform nature. The change in dispersion parameter due to heterogeneity is considered as linear multiple of spatially dependent function and seepage velocity whereas seepag...
متن کاملSolute Transport for Pulse Type Input Point Source along Temporally and Spatially Dependent Flow
In the present study, analytical solutions are obtained for two-dimensional advection dispersion equation for conservative solute transport in a semi-infinite heterogeneous porous medium with pulse type input point source of uniform nature. The change in dispersion parameter due to heterogeneity is considered as linear multiple of spatially dependent function and seepage velocity whereas seepag...
متن کاملNavigation strategies with ecological displays
Ecological interface design (EID) has shown success as an approach for interface design in the case of a process control microworld. However, in applying the EID approach to larger systems, questions arise as how to support the navigation and integration of abstract information. In this study, three ecological displays were developed for a simulated power plant from the same abstraction hierarc...
متن کاملFigure–ground segregation can rely on differences in motion direction
If the elements within a figure move synchronously while those in the surround move at a different time, the figure is easily segregated from the surround and thus perceived. Lee and Blake (1999) [Visual form created solely from temporal structure. Science, 284, 1165-1168] demonstrated that this figure-ground separation may be based not only on time differences between motion onsets, but also o...
متن کاملWinter is coming: How humans forage in a temporally structured environment.
Much is known about visual search for single targets, but relatively little about how participants "forage" for multiple targets. One important question is how long participants will search before moving to a new display. Evidence suggests that participants should leave when intake drops below the average rate ("optimal foraging," Charnov, 1976). However, the real world has temporal structure (...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999